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Abstract 


In  this  report,  multistage  decision  feedback  and  trellis-based  multiuser  receivers  are 
studied  for  convolutionally  coded  CDMA  links.  Some  of  the  receiver  structures  pro¬ 
posed  combine  the  functions  of  equalization  and  decoding  while  some  consider  the 
operations  separately.  A  variety  of  decision  feedback  approaches  are  proposed,  and 
through  simulations,  it  is  shown  that  these  approaches  are  able  to  significantly  outper¬ 
form  the  conventional  basestation  architecture.  Two  trellis-based  approaches  are  also 
proposed.  The  first  is  a  partitioned  structure  which  attacks  the  MUI  equalization  and 
decoding  operations  separately.  The  second  approach  is  the  maximum  likelihood 
sequence  estimator  for  this  problem  which  is  a  combined  equalizer  and  decoder.  This 
decoder  is  formulated  and  its  complexity  is  seen  to  depend  exponentially  on  the  number 
of  users  in  the  system,  the  number  of  states  in  the  individual  users’  encoders  and  the 
number  of  input  information  bits  per  interval. 


1.  Introduction 

In  code  division  multiple  access  (CDMA)  systems,  multiple  users  transmit  over  a 
common  communication  channel,  typically  using  the  direct  sequence  spread  spectrum 
technique.  The  receiver  operating  in  this  environment  receives  a  signal  which  is  the  sum 
of  all  of  the  transmitted  signals  in  noise,  and  the  receiver’s  job  is  to  reliably  decode  the 
signal  of  interest  from  this  received  composite  signal.  The  users  are  not  synchronized  in 
general,  and  in  addition,  the  received  signal  strengths  of  each  user  are  typically  unequal. 
Often,  in  an  attempt  to  improve  the  performance  of  each  link,  error  control  coding  will  be 
used  on  each  of  the  links  as  well.  The  receiver  structures  that  will  be  studied  in  this 
paper  may  be  for  a  base  station  in  a  cellular  telephone  cell  or  personal  communication 
network  (PCN)  cell,  or  it  may  be  for  one  of  the  user’s  receivers  in  a  decentralized  multi¬ 
ple  access  network  of  some  kind. 

The  traditional  method  of  coherently  demodulating  direct  sequence  CDMA  signals 
is  to  synchronize  a  local  code  generator  and  oscillator  to  the  signal  of  interest  and  then  to 
make  decisions  on  the  received  signal  as  though  the  desired  signal  is  the  only  one 
present.  The  received  signal  usually  consists  of  the  desired  signal,  a  multiuser  interfer¬ 
ence  (MUI)  signal,  thermal/shot  noise  and  may  be  further  degraded  by  dispersion.  The 
traditional  decoder’s  structure  is  that  of  a  correlator  or  matched  filter  which  is  matched  to 
the  desired  signal  followed  by  a  decoder  if  coding  is  used  on  the  fink. 

The  performance  of  the  traditional  decoder  suffers  for  two  major  reasons.  First,  the 
signature  sequences  of  the  different  users  will  not  be  orthogonal  to  each  other,  giving  rise 
to  the  MUI,  arid  second,  in  the  common  situation  where  all  of  the  signals  arriving  at  the 
receiver  are  of  different  strengths  the  strong  signals  tend  to  overwhelm  the  weak  signals, 
even  with  reasonably  good  signature  sequences.  This  second  problem  is  referred  to  as 
the  near— far  problem. 

A  major  improvement  over  the  traditional  receiver  can  be  achieved  by  viewing  the 
MUI  not  as  a  random  noise  signal,  but  instead  as  a  structured  interferer.  Because  all  of 
the  signals  making  up  the  MUI  in  a  CDMA  network  are  generally  of  the  same  structure 
as  the  signal  of  interest,  and  because  their  signature  sequences  are  generally  known,  it  is 
possible  to  augment  the  standard  receiver  structure  and  exploit  this  knowledge  of  the 
MUI  by  estimating  it  and  attempting  to  cancel  it,  or  by  jointly  estimating  the  entire  mes¬ 
sage.  The  augmentation  required  consists  of  additional  synchronization  circuitry  to  lock 
into  some  or  all  of  the  interfering  signals,  and  then  to  use  these  additional  statistics  to 
estimate  the  MUI. 
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Rgure  1:  Tree  diagram  of  the  possible  receiver  structures  for  CDMA  systems  operating  with  convolutional 
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Two  decoders  will  be  proposed,  a  partitioned  trellis-based  receiver,  which  treats  the 
equalization  and  decoding  operations  separately,  and  an  optimum  sequence  estimator, 
which  is  the  maximum  likelihood  sequence  estimation  receiver  for  this  problem.  The 
derivation  of  this  optimal  decoder  and  an  analysis  of  its  complexity  and  performance  will 
constitute  a  large  portion  of  this  report. 


2.  Notation 

It  will  be  assumed  that  the  CDMA  system  has  K  users  operating  on  the  same  fre¬ 
quency  in  an  asynchronous  fashion.  In  general,  it  will  be  assumed  that  each  user 
employs  coding  on  its  link.  We  will  confine  our  attention  to  a  particular  form  of  coding, 
namely  binary  convolutional  codes.  The  reason  for  this  confinement  of  the  scope  of  this 
work  is  primarily  out  of  convenience.  While  it  is  quite  conceivable  that  block  codes 
could  be  used  effectively  on  a  CDMA  link,  convolutional  codes  have  the  advantage  that 
they  are  stream  codes,  or  operate  in  a  sequential  fashion.  Because  the  decoders  that  will 
be  studied  in  this  work  are  sequential  in  nature,  the  convolutional  codes  are  a  much  better 
match  to  the  decoders  than  block  codes.  Also,  in  [16]  it  was  shown  that  in  CDMA  sys¬ 
tems,  binary  convolutional  codes  often  outperform  more  general  trellis  codes  which  map 
information  symbols  onto  M-level  signals  where  M  is  larger  than  the  alphabet  size  of  the 
information  symbols.  In  other  words,  there  is  no  particular  advantage  to  using  nonbinary 
signals.  This  may  be  considered  a  further  justification  for  the  confinement  in  scope  of  this 
work  to  binary  convolutional  codes. 

At  each  time  interval,  the  convolutional  code  is  generated  for  user  k  by  passing  P 
binary  information  bits,  hU)  =  (40‘).->40-^+l)).  through  a  shift  register  consisting 
of  W  (P  bit)  stages  and  Q  linear  function  generators,  as  shown  in  Figure  2.  The  input  bits 
are  shifted  through  the  shift  register  P  bits  at  a  time,  and  the  number  of  output  bits  for 
each  P-bit  input  sequence  is  Q  bits.  The  rate  of  the  code  is  —  P  /Q  and  the  constraint 
length  of  the  code  is  W.  The  output  sequence  of  code  bits  for  the  interval  corresponding 
to  input  bits  7*0')  is  (D*(n),...,D*(n-Q+l))-  This  output  bit  sequence  is  comprised  of 
binary  bits,  as  the  linear  function  generators  creating  them  will  be  assumed  to  be 
modulo-2  adders.  Note  that  for  W  =  1  and  P  =  g  =  1,  we  have  the  uncoded  case,  so  in 
that  case  D*(n)  =  hin). 

In  the  time  interval  [/jr-K*,(n +1)7+1*),  user  k  transmits  data  bit  D*(n),  where  x* 
represents  the  time  shift  of  the  Jt'*  user  relative  to  some  reference  time,  thus  accounting 
for  the  asynchronism  of  the  users  relative  to  each  other.  T  represents  the  time  for  one 
code  bit  duration  and  Ti,  ~  QT/P  is  the  information  bit  duration. 


encoded  sequence  to  modulator 


Figure  2  General  convolutional  encoder  structure.  [26] 


Figure  3  Convolutionally  coded  direct  sequence  CDMA  link, 
with  a  conventional  basestation  architecture,  (case  Ej  =  1  shown) 
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Each  user  in  the  system  is  assigned  a  particular  signature  sequence,  and  it  will  be 
assumed  that  this  signature  sequence  has  a  duration  equal  to  the  code  bit  interval, 
although  this  assumption  can  be  relaxed  with  a  change  of  the  notation.  We  will  combine 
the  carrier  and  signature  sequence  into  a  single  signal,  thus  the  carrier  multiplied  by 
the  binary  signature  sequence,  PN^it),  will  be  denoted  by 

Skit—nT -^k)  —  ^2/T  PNkit—nT  —Xk)  cos  {cUcit—nT—Xky)  itT-i^k  ^  t  ^  (n+l)7’+Xfc  (1) 

and  it  will  have  a  time  support  limited  to  the  time  interval  [nr+Tjt,(n+l)7’+Tjfc).  The 
energy  of  the  it**  user’s  signal  measured  at  the  receiver  will  be  denoted  by  Ek.  It  will  be 
assumed  that  all  K  users  transmit  their  signals  through  a  common  additive  white  Gaus¬ 
sian  noise  channel,  and  so  the  received  signal  will  have  the  following  form: 

rit)=  X  j:Dkii)^^Ei^ Skit-iT-T;k)  +  riit)  .  (2) 

Next  we  define  the  partial  cross-correlation  of  signature  sequences  j  and  k  to  be: 

ee 

oo 

It  is  worth  noting  that  pyy(0)=  1  and  Pjk(0  =  Pkji~0- 

In  section  4,  we  will  derive  a  front  end  to  the  receiver  which  is  appropriate  for  the 
maximum  likelihood  sequence  estimator  in  that  section,  however,  for,  now  we  assume 
that  the  front  end  to  the  receiver  consists  of  a  bank  of  K  matched  filters,  each  matched  to 
one  of  the  transmitted  waveforms  in  the  system.  It  was  shown  in  [1]  that  the  complete 
set  of  matched  filter  outputs  generates  sufficient  statistics  for  the  demodulation  of  each 
user’s  data.  The  output  of  the  filter  matched  to  the  k'*  signal  at  time  (i +l)r -K^  is 

rkiO=  J  r{t)skif-iT-<k)dt  (4) 

iT+r* 

where  perfect  synchronization  has  been  assumed  here  between  the  A:**  component  of  the 
received  signal  and  the  local  signature  sequence  generator  at  the  receiver.  Substituting 
for  r(r)  in  equation  (2)  and  integrating,  we  obtain 

rjt(0=  X  +  +  (5) 

n=ifc+l  rt=l  n=l 

The  base  station  that  will  be  referred  to  as  the  conventional  base  station  on  a  coded 
link  is  illustrated  in  Figure  3.  In  this  figure,  the  component  of  the  basestation  which  is 
attempting  to  estimate  the  7'*  user’s  data  operates  with  only  the  matched  filter  outputs  for 
the  7'*  user.  It  will  be  assumed,  as  is  shown  in  Figure  3,  that  the  Viterbi  algorithm 
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operating  on  each  channel’s  observed  code  symbols  is  a  soft-Viterbi  algorithm  operating 
with  a  decoding  delay  of  8  information  symbols.  Note  that  generally  5  will  be  several 
times  the  constraint  length,  W,  of  the  code. 

As  in  the  uncoded  case,  there  are  a  number  of  ways  that  a  joint  processor  can 
operate  to  improve  upon  the  performance  of  a  conventional  basestation.  To  this  point, 
there  has  been  some  work  on  the  analysis  of  the  conventional  basestation  for  the  coded 
Unk  problem,  [16],  but  very  litfle  has  been  reported  in  the  literature  on  the  design  of  joint 
processors  for  coded  links. 

In  [18]  a  sliding  window  version  of  the  decorrelating  detector,  which  was  the  linear 
decoder  first  proposed  in  [4],  was  studied  and  the  authors  alluded  to  the  use  of  a  convolu¬ 
tional  code  on  the  link.  In  that  approach,  a  linear  detector,  whose  decision  statistic  is  a 
linear  combination  of  the  matched  filter  outputs,  is  used  to  equalize  the  MUI  prior  to 
decoding  with  the  Viterbi  algorithms.  Results  of  this  approach  are  not  given,  although 
they  will  undoubtably  appear  in  a  subsequent  article. 

Because  the  decision  feedback  approaches  proved  to  be  an  attractive  alternative  in 
the  uncoded  case,  (see  [2],  [3],  [11]  and  [21])  due  to  their  reasonable  complexity  and 
good  performance,  it  is  natural  to  begin  by  extending  the  decision  feedback  approaches 
to  coded  links.  We  will  use  the  term  decision  feedback  decoders  in  reference  to  the  class 
of  multistage  decoders  for  the  MUI  equalization  problem.  This  approach  will  be  exam¬ 
ined  in  section  3  of  this  report. 

An  obvious  question  with  any  system  we  wish  to  analyze  is,  "What  is  the  optimal 
decoder’s  structure  and  performance?"  Because  the  optimal  sequence  estimator  for 
equalizing  an  uncoded  CDMA  system  is  a  Viterbi  algorithm,  and  because  the  Viterbi 
algorithm  is  also  the  optimal  decoder  for  a  single  convolutionally  coded  link,  it  is  reason¬ 
able  to  expect  that  the  optimal  sequence  estimator  for  the  coded  link  will  also  be  a 
Viterbi  algorithm  operating  on  an  appropriate  trellis.  It  follows  then,  that  trellis-based 
decoding  techniques  will  be  an  appropriate  and  interesting  class  of  decoders  to  study.  In 
section  4,  both  a  partitioned  trellis-based  approach  and  the  optimum  maximum  likelihood 
sequence  estimator  will  be  derived.  It  will  be  shown  that  these  decoders  have  an 
exponential  complexity  growth  with  K  and  W,  and  so  this  motivates  our  search  for  an 
effective  suboptimum  solution  in  the  next  section. 

3.  Decision  Feedback  Decoders  for  Coded  Links 

The  most  naive  way  in  which  decision  feedback  can  be  applied  to  a  coded  link  is 
illustrated  in  Figure  4a.  In  this  approach,  termed  the  hard  code  symbol  DFE  approach. 


Rgure  4a  Hard  code  symbol  DFE  approach  using  a  7-stage  DFE  on  the  code  symbols. 
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Figure  4b  Soft  code  symbol  Dra  approach  using  a  7-stage  DFE  on  the  code  symbols  with 
hard  decision  in  the  final  stage. 


Figure  4c  Integrated  DFE  with  Viterbi  algorithms  operating  at  each  stage  of  the  DFE. 
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a  DFE  of  some  type  jointly  estimates  the  code  bits  and  then  feeds  the  results  to  a  bank  of 
hard-decision  Viterbi  decoders  which  use  a  Hamming  distance  metric. 

A  second  approach,  which  will  be  called  the  soft  code  symbol  DFE  approach 
(SCS-DFE),  is  illustrated  in  Figure  4b.  In  this  approach,  again  a  multistage  DFE  operates 
on  the  set  of  matched  filter  outputs,  by  making  tentative  decisions  and  feeding  these  deci¬ 
sions  back  to  make  estimates  of  the  MUI  which  will  be  subtracted  from  other  matched 
filter  outputs.  The  key  difference  in  the  SCS-DFE  is  that  at  the  final  stage,  the  MUI  esti¬ 
mate  will  again  be  subtracted  from  the  delayed  matched  filter  output,  but  no  hard- 
decision  making  will  be  performed.  Instead,  the  modified  matched  filter  output  will  be 
passed  straight  to  the  Viterbi  decoder.  This  Viterbi  decoder  will  be  a  soft-decision 
decoder  and  its  metric  will  be  a  correlation-style  metric  as  opposed  to  the  Hamming  dis¬ 
tance  metric  of  the  hard-decision  Viterbi  decoder. 

At  this  point,  we  have  not  committed  to  a  particular  type  of  multistage  DFE.  There 
have  been  at  least  three  architectures  proposed  in  the  literature  for  asynchronous  CDMA 
links,  [2],  [3],  and  [11],  and  there  has  also  been  some  work  on  improving  the  decision 
making  procedure  of  the  algorithms,  [21].  In  [21],  techniques  such  as  the  use  of  a 
decorrelator  as  the  first  stage  decision  maker,  or  a  linear  clipper  or  both  were  shown  to 
provide  gains  in  terms  of  the  asymptotic  multiuser  efficiency  for  the  two-user  case.  The 
improved  decision  making  procedures  in  [21]  can  easily  be  applied  to  any  of  the  three 
basic  architectures.  In  this  report,  we  will  refer  to  the  architecture  in  [2]  as  the  Varanasi 
version,  the  architecture  in  [3]  as  the  Xie  version,  and  the  architecture  in  [11]  as  the 
Hybrid  version  in  keeping  with  the  names  used  in  [11]. 

All  of  these  code  symbol  DFE  approaches  operate  at  the  code  symbol  level  as 
though  there  was  no  coding  on  the  link,  and  then  pass  their  decisions  or  improved  statis¬ 
tics  to  an  outer  decoder.  The  problem  with  this  approach  is  that  the  energy  in  each  code 
symbol  is  less  than  that  of  each  of  the  information  symbols  by  a  factor  of  7?^,  the  code 
rate.  Furthermore,  separating  the  functions  of  cancelling  the  MUI  and  decoding  the  mes¬ 
sage  does  not  take  full  advantage  of  the  coding  on  the  link.  The  following  approach 
attempts  to  alleviate  these  shortcomings. 

The  decision  feedback  equalizer  illustrated  schematically  in  Figure  4c,  and  more 
explicitly  in  Figure  5,  will  be  called  the  integrated  DFE.  The  idea  in  the  integrated  DFE 
is  that  the  MUI  can  be  more  reliably  estimated  by  exploiting  the  coding.  In  other  words, 
instead  of  using  a  hard-decision  device  in  the  first  stage  of  the  DFE,  we  may  decode  the 
message  using  a  soft-decision  Viterbi  algorithm  operating  on  the  stream  of  matched  filter 
outputs  in  the  channel,  and  then  re-encode  the  decoded  bits  to  form  estimates  of  the 
code  bits.  Due  to  the  delay  in  the  Viterbi  decoder,  the  delay  between  stages  and  the 


Figure  5  The  structure  of  a  3-stage  integrated  DFE  for  the  it**  user  with  Viterbi  algorithms  (denoted  VA) 
at  each  stage.  A  is  the  maximum  delay  in  code  bit  periods  corresponding  to  6  information  bit  periods 
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buffering  operations  will  be  greater  than  in  a  code  symbol  DFE.  In  this  DFE  approach,  at 
each  stage,  Viterbi  decoders  will  replace  the  hard-decision  comparators  that  were  present 
in  the  code  symbol  multistage  DFEs.  Again,  as  in  the  uncoded  case,  the  decision  feed¬ 
back  may  be  performed  for  as  many  stages  as  is  desired. 

A  characteristic  of  convolutional  codes,  or  most  codes  for  that  matter,  is  that  at  very 
low  signal  to  noise  ratios,  the  coded  link  may  perform  worse  than  an  uncoded  link. 
When  the  signal  to  noise  ratio  is  in  this  regime,  it  is  possible  that  the  Viterbi  decoder 
whose  outputs  are  re-encoded  to  form  the  ML  estimate  of  the  code  bit  sequence,  may 
perform  worse  than  a  simple  hard-decision  device  operating  on  the  code  bits  without 
regard  to  the  coding.  As  a  result  of  this  characteristic,  it  is  important  that  the  combina¬ 
tion  of  the  thermal  noise  and  MUI  is  not  so  strong  that  the  re-encoded  Viterbi  output 
sequence  is  worse  than  the  estimated  code  bits  of  a  simple  threshold  detector  for  the 
integrated  DFE  to  outperform  an  SCS-DFE.  Basically,  the  structure  which  provides 
better  estimates  of  the  code  bit  sequence  will  have  a  better  estimate  of  the  MUI  in  the 
other  channel’s  multistage  decoders.  In  general,  because  coding  generally  allows  better 
estimates  of  the  transmitted  sequence,  it  is  reasonable  to  expect  the  integrated  DFE  to 
outperform  an  SCS-DFE  of  a  similar  architecture. 

The  performance  of  this  class  of  approaches  is  difficult  to  evaluate  analytically, 
especially  for  more  than  two  users,  as  in  the  uncoded  case,  [2],  [3],  and  [11].  As  a  result, 
simulation  will  be  the  performance  evaluation  technique.  Because  the  structure  of  the 
integrated  DFE  is  most  like  the  Varanasi  style  uncoded  link  multistage  decoder,  it  is 
interesting  to  compare  the  integrated  DFE  with  the  Varanasi  style  SCS-DFE.  In  addition, 
because  it  was  shown  in  [11]  that  in  most  cases,  the  Hybrid  DFE  outperforms  the  other 
two  architectures  on  an  uncoded  link,  it  is  an  obvious  candidate  for  use  in  an  SCS-DFE 
structure.  Thus,  the  structures  that  were  simulated  were  the  Hybrid  and  Varanasi  ver¬ 
sions  of  the  SCS-DFE  and  the  integrated  DFE.  The  modifications  to  the  decision  making 
devices  in  each  preliminary  stage  of  the  multistage  decoders  discussed  in  [21]  were  not 
considered  here,  although  those  modifications  may  provide  improvements  in  some  cases. 

Figure  6  shows  two  sets  of  signature  sequences  for  a  four-user  system  (the  complex 
envelopes  of  the  signals  are  shown  assuming  a  carrier  phase  of  zero).  The  resulting 
crosscorrelation  structure  is  also  shown  in  this  figure.  There  are  many  signature  sets  and 
delays,  in  general,  which  yield  the  same  crosscorrelation  structure,  and  so  the  signature 
sets  which  are  shown  are  provided  only  to  illustrate  that  the  set  of  crosscorrelations  simu¬ 
lated  are  achievable.  (It  is  possible  to  construct  correlation  matrices  which  are  not  in  the 
range  of  achievable  values.)  Both  signature  sets  shown  are  constructed  to  provide  a  high 
level  of  MUI  which  is  fairly  balanced  between  the  users.  This  choice  of  signature 
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Rgure  6a  An  example  set  of  signature  waveforms  and  delays  which  yield  the  correlation  structure  shown. 
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Figure  6b  A  second  set  of  signature  waveforms  and  delays  which  yield  the  more  severe  correlation  structure  shown. 
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waveforms  does  not  necessarily  represent  an  intelligent  choice  of  waveforms  for  this  par¬ 
ticular  chip-to-symbol  rate  ratio.  It  is  also  probably  safe  to  say  that  if  the  MUI  was  much 
more  severe  than  that  simulated  in  these  examples,  then  the  dominating  problem  would 
most  likely  be  that  of  the  initial  acquisition  of  the  local  oscillators  and  code  sequence 
generators  at  the  front  end  of  the  receiver  rather  than  the  performance  of  the  multiuser 
detectors  if  synchronization  is  achieved. 

Figure  7  shows  performance  on  the  "0.2  channel"  illustrated  by  Figure  6a.  As  Fig¬ 
ure  7  illustrates,  the  conventional  decoder  suffers  about  a  3  dB  loss  at 
Pb  average  =2’10~^  relative  to  the  performance  of  the  same  receiver  operating  in  the 
absence  of  MUI.  In  this  environment,  the  integrated  DFE  is  able  to  nearly  recoup  all  of 
this  loss,  while  the  various  SCS-DFEs  are  able  to  only  recoup  some  of  the  loss.  It  may 
thus  be  concluded  that  the  thermal  noise  and  MUI  are  weak  enough  to  be  operating  in  the 
regime  where  a  receiver  which  exploits  the  coding  performs  better  than  one  which  does 
not.  The  Hybrid  version  of  the  SCS-DFE  outperforms  the  Varanasi  version  of  the  SCS- 
DFE  which  is  similar  to  the  results  obtained  on  the  uncoded  link  simulated  in  [11]. 

Figure  8  shows  the  performance  on  the  more  severe  channel  illustrated  in  Figure  6b. 
In  this  figure,  it  is  evident  that  all  of  the  decoders  perform  significantly  worse  than  in  the 
"0.2  channel"  due  to  the  more  severe  MUI.  The  integrated  DFE  still  uniformly  outper¬ 
forms  the  Varanasi  version  of  the  SCS-DFE.  For  these  particular  channel  characteristics, 
however,  the  hybrid  SCS-DFE  is  able  to  outperform  the  integrated  DFE  at  larger  values 
of  Eb/No.  While  this  may  seem  surprizing  at  first,  it  is  simply  due  to  the  fact  that  even 
though  the  hybrid  SCS-DFE  performs  separate  equalization  and  decoding,  it  has  a  high 
quality  first  stage  which  is  able  to  provide  better  code  symbol  estimates  to  the  second 
stage  MUI  estimator  than  the  conventional  Viterbi  algorithm  operating  in  the  first  stage 
of  the  integrated  DFE.  This  case  illustrates  that  when  the  MUI  is  strong  enough,  the 
integrated  DFE  will  not  always  outperform  a  well  designed  SCS-DFE,  although  it  does  in 
most  cases. 

An  additional  key  consideration  in  the  choice  of  the  receiver  structure  is  complex¬ 
ity.  The  relative  complexities  of  the  various  approaches  may  be  estimated  by  considering 
the  time  complexity  per  bit.  [1]  This  measure  is,  roughly  speaking,  the  number  of 
"metrics"  which  must  be  computed  for  each  decided  bit.  In  the  DFE  structures  con¬ 
sidered  here,  metrics  are  formally  being  computed  in  the  Viterbi  decoders,  and  there  is 
some  additional  complexity  associated  with  the  MUI  estimation  procedure  in  the  DFE 
part  of  the  receivers.  In  general,  the  computation  of  the  MUI  in  each  stage  of  the  DFE 
structures  is  roughly  equivalent  in  complexity  to  the  computation  of  one  metric  in  the 
Viterbi  decoder.  Thus  adopting  this  convention,  we  may  conclude  that  a  7-stage  SCS- 


Figure  7  Performajice  curves  of  the  various  decision  feedback  receivers  for  the  4-user 
0.2  channel  illustrated  in  Figure  6a.  The  solid  lines  show  a  single  user  system  (no  MUI) 
with  and  without  the  rate- 1/2  4-state  convolutional  code.  Also  shown  are  the  one,  two 
and  three  stage  soft  code  symbol  DFEs  for  both  the  Varanasi  (dashed)  and  Hybrid  (dot¬ 
ted)  architectures,  and  a  one,  two  and  three  stage  integrated  DFE  (dashed  lines).  Note 
that  the  Varanasi  style  one-stage  soft  code  symbol  DFE  and  the  one-stage  integrated  DFE 
are  both  equivalent  to  the  conventional  receiver. 


Figure  8  Performance  curves  of  the  various  decision  feedback  receivers  for  the  more 
severe  4-user  0.25  channel  illustrated  in  Figure  6b.  The  solid  lines  show  a  single  user 
system  (no  MUI)  with  and  without  the  rate- 1/2  4-state  convolutional  code.  Also  shown 
are  the  one,  two  and  three  stage  soft  code  symbol  DFEs  of  both  the  Varanasi  (dashed) 
and  Hybrid  (dotted)  architectures,  and  a  one,  two  and  three  stage  integrated  DFE  (dashed 
lines).  Note  that  the  Varanasi  style  one-stage  soft  code  symbol  DFE  and  the  one-stage 
integrated  DFE  are  both  equivalent  to  the  conventional  receiver. 
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DFE  operating  on  a  link  with  rate  \/Q  codes,  for  example,  has  a  time  complexity  per 
decoded  bit  of  roughly  TCB  =  O  where  W  is  the  constraint  length  of  the  code 

used.  In  contrast,  the  7-stage  integrated  DFE  has  a  time  complexity  of  roughly  TCB  = 
Oi(J-l)Q+J2^),  which  is  significantly  higher  than  that  of  the  SCS-DFE,  although  it  is 
far  less  than  that  of  the  trellis-based  approaches  which  will  be  studied  in  section  4  of  this 
report. 

In  the  general  rate  P/Q  code  case,  if  k  =  log25  where  S  is  the  number  of  states  in 
each  user’s  encoder,  then  k  represents  the  total  binary  memory  order  of  each  user’s 
encoder.  Using  this  definition,  it  is  not  difficult  to  determine  the  TCB  for  this  general 
case.  The  SCS-DFE  has  a  TCB  =  0  ([JQ  +  2'^^^yP)  and  the  integrated  DFE  has  TCB  = 
0([(J-l)Q+J2'^^^yP). 

Another  consideration  in  the  choice  of  receiver  structures  may  be  the  delay  between 
the  observation  of  the  matched  filter  output  corresponding  to  a  particular  information  bit 
and  the  ultimate  estimation  of  that  bit.  If  it  is  assumed  that  the  Viterbi  decoders  used 
operate  with  a  decoding  delay  of  5  information  symbols,  which  is  typically  on  the  order 
of  5W,  then  the  overall  decoding  delay  of  the  7-stage  SCS-DFE  is  (J-lYQ  +  d-  J  +  5W. 
The  co*iresponding  decoding  delay  of  the  7-stage  integrated  DFE  is  roughly  75  =  57W. 

We  may  thus  conclude  that  the  integrated  DFE  has  a  larger  TCB,  and  a  longer 
decoding  delay  than  any  of  the  SCS-DFE  approaches,  but  its  performance  is  better  in 
most  cases.  As  a  result,  the  appropriate  choice  of  receiver  configuration  will  depend  on 
the  expected  severity  of  the  channel  and  the  complexity  and  delay  constraints  on  the 
receiver.  As  we  have  seen  in  Figure  8,  however,  even  the  integrated  DFE  does  not  per¬ 
form  well  when  the  MUI  becomes  too  strong.  As  a  result,  it  is  natural  to  look  for  nearly 
optimal  and  optimal  receiver  structures  which  do  perform  well  even  in  these  severe 
cases.  The  other  motivation  for  looking  to  the  nearly  optimal  and  optimal  schemes  is  to 
provide  a  benchmark  for  comparison  with  the  suboptimal  DFE  schemes  in  all  cases.  This 
will  allow  us  to  see  how  close  the  suboptimal  approaches  are  performing  to  the  optimal 
approaches.  These  nearly  optimal  and  optimal  schemes  are  the  subject  of  the  next  sec¬ 
tion. 

4.  Trellis-Based  Receivers 

The  standard  decoder  for  a  convolutional  code  is  the  Viterbi  algorithm  operating  on 
the  encoder’s  trellis  because  it  allows  maximum  likelihood  sequence  estimation  with  a 
modest  decoder  complexity  when  the  constraint  length  of  the  code  is  small,  [1/].  The 
optimal  joint  decoder  for  an  asynchronous  CDMA  system  is  also  a  Viterbi  algorithm 
operating  on  a  time- varying  trellis  with  2^”*  states,  [1].  The  simplest  trellis-based 
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approach  is  therefore  the  decoder  shown  in  Figure  9a,  which  will  be  referred  to  as  the 
partitioned  trellis-based  solution.  In  this  approach,  an  optimal  equalizer  for  the  MUI 
operates  at  the  code  symbol  level  and  feeds  the  resulting  hard  code  symbol  estimates  to  a 
bank  of  Viterbi  algorithms  which  operate  independently  of  each  other.  These  Viterbi 
algorithms  decode  the  ML  information  sequence  for  each  respective  user  given  the  deci¬ 
sions  from  the  MUI  equalizer. 

The  time  complexity  per  decoded  bit  for  this  receiver  may  be  estimated  again  by 
considering  the  number  of  metric  computations  per  information  bit  decided.  We  again 
consider  first  the  rate  \/Q  code  example.  The  optimal  equalizer  computes  2  metrics  for 
every  bit  that  is  decided  and  there  are  '2^  metrics  computed  in  the  decoding  Viterbi  algo¬ 
rithm  for  every  bit  decided,  so  the  total  TCB  =  0(2  2*^ +2'^).  In  the  general  rate  P/Q 
code  case,  TCB  =  0  ([2  2^ + 

Another  variation  of  the  partitioned  trellis-based  decoder  would  be  to  modify  the 
equalizer  Viterbi  algorithm  to  supply  soft-decision  outputs  to  the  outer  decoding  Viterbi 
algorithms.  This  could  easily  be  accomplished  using  the  techniques  described  in  [29]. 
This  idea  is  the  trellis-based  analog  of  the  SCS-DFE  and  it  is  reasonable  to  expect  that  its 
performance  would  be  slightly  better  than  the  standard  partitioned  trellis-based  receiver 
in  most  cases. 

One  major  limitation  of  the  partitioned  trellis-based  decoder  approach,  of  the  soft  or 
hard-decision  variety,  is  that  the  equalization  of  the  MUI  will  have  to  take  place  at  the 
code  symbol  level.  Because  the  code  symbols  have  a  smaller  energy  per  symbol  than  the 
information  symbols,  it  is  reasonable  to  expect  that  the  decoder  which  takes  the  coding 
into  account  and  effectively  operates  at  the  information  symbol  level  can  attain  a  higher 
performance  leveL  This  idea  is  the  trellis-based  analog  of  the  integrated  DFE.  Figure  9b 
illustrates  the  optimal  maximum  likelihood  sequence  estimator,  and  in  sections  4. 1  and 
4.2,  this  decoder  will  be  derived. 

4.1  Optimum  Sequence  Estimator  For  Rate-1/2  Convolutional  Codes 

In  this  section,  the  optimal  MLSE  will  be  derived  for  the  special  case  in  which  each 
user  in  the  network  is  employing  a  rate- 1/2  convolutional  code  with  a  constraint  length  of 
W.  Our  limitation  to  this  special  case  will  facilitate  the  derivation  of  the  decoder  consid¬ 
erably,  and  it  will  be  outlined  in  section  4.2  how  the  optimal  decoder  can  be  derived  in  a 
similar  fashion  for  a  general  rate-P/2  convolutional  code  case. 

To  begin,  it  is  important  to  note  that  the  optimal  sequence  estimator  or  equalizer  for 
multiple-user  uncoded  signals  operates  in  a  round-robin  fashion  among  all  K  users  in  the 
system,  [1].  This  Viterbi  algorithm  traverses  one  trellis  stage  per  channel  bit  observed. 


andAWGN 


Rgure  10  Trellis  code  view  of  a  rate- 1/2  convolutional  encoder  with  BPSK  modulation  for  user  k  at  time  i. 
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The  optimal  sequence  estimator  for  decoding  the  rate- 1/2  code  for  one  of  the  users  in  a 
single  user  environment,  however,  is  a  Viterbi  algorithm  which  requires  two  channel 
observations  from  the  user  of  interest  to  move  ahead  one  stage  in  the  trellis.  The  fact  that 
the  equalizer  and  decoder  operate  in  a  fundamentally  different  way  suggests  that  a 
slightly  different  view  of  the  problem  is  required.  The  following  view  of  the  problem 
will  be  adopted  in  order  to  bypass  this  issue.  The  rate- 1/2  convolutional  code  can  be 
viewed  not  as  a  code  which  produces  two  binary  bits  per  information  bit  period,  Tb,  but 
as  an  equivalent  trellis  code  which  produces  one  4-ary  coded  waveform  every  T^,.  We 
will  reorient  ourselves  with  respect  to  these  super-symbols  which  occupy  an  information 
bit  period,  and  then  examine  the  effect  of  MUI  on  these  super-symbols.  By  then  formu¬ 
lating  the  equalization  problem  at  the  receiver  with  respect  to  the  super-symbol  view  of 
the  received  signal,  we  can  accomplish  both  the  tasks  of  equalization  and  decoding  in  the 
same  Viterbi  algorithm.  Because  there  is  only  one  super-symbol  received  for  each  infor¬ 
mation  bit  that  must  be  decided,  the  decoder  can  be  formulated  in  the  same  fashion  as 
was  used  in  [1]  for  the  MUI  problem  and  [19]  for  the  ISI  problem. 

We  begin  by  defining  the  following  functions. 
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If  the  outputs  of  the  first  and  second  modulo-2  adders  in  the  convolutional  encoder  are 
given  respectively  by  D  Ijfc(i)  and  D2jfe(i)  for  the  information  symbol  interval  and  the 
it**  user  (see  Figure  10),  then  the  output  of  the  equivalent  trellis  code  can  be  defined  with 
the  following  mapping  rule. 
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Now,  the  following  function  may  be  defined 

Dkit-iTb-Qk)  =  Re  [dkd)  f  (t-iTb-Qk)  ]  (10) 

where  Ojk  =4  if  the  code  is  synchronized  such  that  D  4(t)  is  sent  in  the  time  interval 
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rE[/Ti,-K*,(j +‘/2)rfc-Kfc).  On  the  other  hand,  if  the  code  is  synchronized  such  that  D 
is  sent  in  the  time  interval  re[(i+‘/2)r/,-Kjk,(i+l)Ti,+tjfc)  then  0*  sTj^-l-r.  This  parameter 
Gjk  represents  the  time  shift  of  the  user’s  super-code  symbols  relative  to  the  same 
reference  time  that  was  used  to  define  the  set  {Xjt}  f=i.  It  is  also  worth  noting  that  with 
the  definition  of  {0*}  f=i,  the  users  are  no  longer  ordered  such  that  their  delay  increases 
with  it  as  we  assumed  when  we  were  working  at  the  code  bit  level.  It  is  convenient  to 
assume  at  this  point  that  the  users  are  reindexed  so  that  they  once  again  have  an  increas¬ 
ing  delay  with  k. 

Equation  (10)  may  be  rewritten  as 

Dkit-iTb-^k)  =  Re  [4(0]  Mt-iTb-Qk)  “  [4(01  fi  (t  -iTb-Qk)  (1 0 

by  substituting  equation  (8)  into  (10).  Note  that  Dkit)£.{  ±  fo(t),  ±  /i(0  }•  This 
waveform  has  a  duration  of  Tb  =  2T.  Next,  two  of  the  signature-carrier  waveforms  are 
scaled  and  concatenated  to  form  a  super-signature  waveform. 

hit-iTb-Qk)  =  '^M-iTb-ek)  +  Skit-(i+^/i)Tb-^^^  (12) 

(Recall  that  s^it)  had  support  on  [0,7). )  The  received  waveform  may  now  be  written  in 
terms  of  these  waveforms  of  duration  7),: 

r(0=  X  I,^kit-iTb-Qk)h(t-iTb-Qk)^  +  n(t)  (13) 

i=-«jt=i 

where  =  V2£jfc  -  This  may  be  explicitly  written  in  terms  of  the  complex  super-code 
symbols,  4(0.  by  substituting  (1 1)  into  (13). 

r(0=  X  ^Re[dk(.i)]Mt-iTb-Qk)hit-iTb-^k)^ 

i=>-oo/:=l 

-Im[dk(i)]fiit-iTb-Qk)Mt-iTb-Qk)^+n(.t)  04) 

Figure  11  illustrates  a  two-state  rate- 1/2  convolutional  code  and  its  equivalent  trellis 
representation.  It  is  worth  noting  that  the  signal  that  results  from  two  code  symbol  inter¬ 
vals  in  a  rate- 1/2  convolutional  code  is  now  modeled  as  a  signal  resulting  from  a  rate-1/2 
trellis  code  with  one  complex  super-code  symbol  modulating  a  pair  of  orthonormal  basis 
functions  through  the  procedure  defined  above.  The  basis  functions  in  this  new  view  ot 
the  waveform  are 

il>ot«)  =  /o«)si(0  (15) 

and 


<l>iifc(0=/i(04(0 


(16) 


0-^1 

l-»-l 


Figure  11a  Rate- 1/2  convolutional  encoder  and  its  equivalent  trellis  representation. 


Figure  1  lb  Trellis  diagrams  and  signal  space  diagrams  for  each  view  of  the  code  from  Figure  9a. 
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appropriately  synchronized  with  the  information  bit  periods.  Thus,  this  view  of  the  cod¬ 
ing  process  suggests  that  the  information  bits  are  mapped  by  the  encoder  onto  waveforms 
in  a  space  defined  by  <t>ojfc(0  and  (|)u-(0-  The  resulting  signal  is  equivalent  to  that  emerg¬ 
ing  from  the  convolutional  encoder  and  modulator,  and  furthermore,  the  performance  of 
a  Viterbi  algorithm  operating  in  the  space  defined  by  ^okiO  and  (t)u(r)  will  be  identical  to 
that  of  a  Viterbi  algorithm  operating  with  two  observations  from  a  space  defined  by  the 
basis  function  Skit). 

Taking  the  trellis  viewpoint  of  the  coder,  we  note  that  although  the  basis  functions 
defined  in  (15)  and  (16)  are  orthonormal,  they  are  not,  in  general,  orthogonal  to 
and  <t>iy(t)  which  are  the  basis  set  for  another  user  in  the  system,  user  j,  since  Skit)  and 
Sj(t)  are  not  orthogonal  in  general.  The  result  when  the  received  signal  is  a  sum  of  K 
component  signals  is  MUI. 

We  now  define  four  parameters  which  are  a  measure  of  the  degree  of  correlation 
between  the  basis  functions  of  the  different  users. 


Pyjfc(0=  J 

(17) 

oo 

Ujk(l)=  j^ijit-ej)Mt-lTb-Qk)dt 

1"  oo 

(18) 

eo 

j  ^Oj(t-^j)^lk(i-iTb-Qk)dt 

(19) 

eo 

(20) 

These  parameters  play  the  same  role  in  the  super-symbol  view  of  the  coded  signal  that 
Pjk(l)  plays  for  the  standard  view  of  the  signal.  In  fact,  p,  U,  V  and  W  can  be  related  to  p 
directly  by  substituting  (12)  into  (17)  through  (20),  as  long  as  the  relationships  between 
Qj  and  Xj  as  well  as  6k  ®ttd  Xk  are  known.  If,  for  example,  either  6j=Xj  and  0;^  ='^k  or 
6j  =Xj+T and  6k=Xk+T then  the  following  holds: 

P;*(/)  =  P;*(2/)+  V2pjk(2l+1)+  V2pjk(2l-l) 

(21a) 

Ujkd)  =  Pjk(2l)  -  V2pjk(2l-l)-V2pjki2l+l) 

(21b) 

Vjk(l)  =  V2pjki2l-l)-V2pjk(2l+l) 

(21c) 

Wjk(l)  =  V2Pjk(2l+l)-V2pjk(2l-l) 

(21d) 

Similar  results  hold  when  the  relationships  between  the  delays  are  different.  It  is  also 
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important  to  note  that  regardless  of  the  relationship  between  x  and  0,  pjkil)  =  Ujk(0  = 
Vjkil)=  Wjk(l)  =  0  for  I/I>1;  this  fact  will  play  an  important  role  in  determining  the 
proper  state  description  of  the  system  for  the  optimal  sequence  estimator.  Some  other 
useful  properties  of  the  correlation  parameters  are  pjkil)  =  PkjH)^  Ujkil)  =  Ukji-l)  and 
WjkH)  =  %jH)  =  -Wkji-l)  =  -VjkH). 

To  this  point,  as  Figure  9b  indicates,  we  have  made  no  assumptions  about  the  struc¬ 
ture  of  the  decoder  or  even  the  front-end  processing  that  will  supply  sufficient  statistics  to 
the  decoding  algorithm.  While  a  bank  of  matched  filters  matched  to  the  signature 
sequences  was  assumed  as  the  front  end  for  the  decision  feedback  techniques  examined 
in  section  3,  we  make  no  such  assumption  for  the  optimal  sequence  estimation  receiver 
of  this  section.  We  will  proceed  with  the  derivation  of  the  optimal  decoder,  now  that  the 
foundation  has  been  laid  and  the  appropriate  structure  of  the  decoding  algorithm  and  the 
front  end  of  the  receiver  will  become  apparent.  This  development  will  closely  follow  the 
derivation  of  the  optimal  MLSE  in  [19]  and  [26]  for  the  uncoded  ISI  channel. 

Beginning  with  equation  (13),  note  that  by  performing  a  modulo-jK"  decomposition 
of  the  index  i,  namely  i  =a(/)A'+P(t)-l,  we  can  write 

r(0=  E  ^p(o(^-a(O7*-0|5(o)^P(.)(^-«(O7fc-ep(i))>/^6P«  (22) 

i=-A/ 

This  assumes  that  the  K  users  transmit  2M/K+1  information  bits  each  in  the  time  interval 
of  interest.  In  other  words,  instead  of  the  initial  sum  of  equation  (13)  ranging  from  /=-« 
to  oo,  it  ranges  from  /  =—M/K  to  M/K. 

Next,  we  use  an  orthonormal  series  representation  of  r  (r)  using  a  basis  {gjfc(0}  • 

r(r)=  lim  .  (23) 

where 

=  X  « P(0.ifc(a(0)^^^  +  (24) 

i=-M 

eo 

9p(i).ifc(a(0)  =  I  ^p(o(^-a(O2t-0p(o)^P(o(^-«(O2fc-0p(o)5*(Oc/^  (25) 

'  oo 

and 

oo 

nk=  \n(t)glit)dt  (26) 

— oo 

which  is  nornitilly "distributed  with  zero  rnCtin  und  variunce  of  —  (iVQ/^)5(/r“‘/Tz), 
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where  6(A:)  is  the  Kronecker  delta  function. 

The  joint  conditional  probability  density  function  for  the  observations  given  the 
hypothesized  data  sequence  D  is 

exp  X  <7p(i).Jfc(o^(0)V^^  I  (27) 

\k=i  ^  J  t  ‘=-^ 

In  the  limit  as  y-^^,  the  logarithm  of  P(r^^\  . . .  \D)  becomes  proportional  to  the 

following  waveform  metric: 

_ 

/o(0)  =  - j  ^rity-  X  3po)(t-a(Orz,-0po))?p(o(^-a(O7),-0p(o  (28) 

— oo  i=-A/ 

By  expanding  the  square  we  get 

Jq(D)  =  -\  \r(,t)\^dt  +  2Re[  X 

j=-M 

eo 

•{Re  [i/p(,)(ot(i))]  J  r(O/o(^“Ot(O7)>-0p(,))Jp(i)(f“Ot(O7fc— 9|i(,))f/t 

oo 

-7m  [c/p(,)(a(i))]  J  r(r)/i(r-a(i)7f,-0p(i))5p(,)(?-a(O7i,-9p(i))iit)] 

Af  Af  ^  r  1 

-  X  X  j  Ue[^P(i)(a(O)]/o(?-a(Orz,-0p(/))-/m[^p(o(a(i))]/i(t-a(i)n-ep(,)^ 

i  zs-Mm  =-A/— “  ^ 

•  [d  p(„)  (a(m))]/o  (t  -a(.m)Ti,-Q^^„^  )-Im[d  p(„)  (a(m))]/ 1  (r  -a(m)ri,-0p(„)  ^ 

•?P(i)(^~ct(^ )7),-0p(i))  ?p(m)(^-oc('”)7fc-0p(m)) V^'frpo)  V^frpcm)  (29) 

Next,  define 

oo 

rop(,)(a(0)  =  J  r(O/o(^-a(/)7i,-0p(o)?P(.)(^-a(*)^i>-®PO))^^  (30) 

— oo 

and 

oo 

rip(.)(a(0)  =  J  r(O/i(f-a(O7i,--0p(o)?p(/)(^-a(O7'z,-0p(o)^*  (31) 

— >oo 

which  represent  the  outputs  of  a  pair  of  matched  filters  or  correlators  for  the  basis  tunc- 
tions  for  user  PO)  at  time  t  =  (a(i)+l)Tb  +  0p(,). 

By  expanding  (29),  and  then  collecting  the  appropriate  terms  we  get  the  following 
metric. 
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JoiDi)  =  7o(A-i)  +  2  Re[  Re  [ffp(.)(«('))]V£w  (^op(.)(a(0)  - 

^  r 

[^|3O-/)(^0  ~0)1  P[i(i)P(i-/)(Ot0  “0“"^0  )) 

/=1 

—  Im  [^p(,-/)(ot(i  — /))]  V p(,)p(,-/)(ct(i “/)“Oc(i))y "N/^ftpo-;) ) 
-/m[^/p(,)(a(i))]V£^tp(.)  (rip(i)(a(i))  + 

j^/m [^p(,-/)(oc(/— /))]  f/ p(op(/-/)(^0 

/:=! 

-;?e [^/p(._,)(a(/ -/))]  Wp(,)3(...,)(a(/-/)-a(/))yVJ?frp(.-/)  )]  -  2£i,p(o  (32) 

where  D,-  represents  the  data  sequence  up  to  time  interval  i,  and  L  is  the  smallest  integer 
such  that  for  every  L>L  we  have  pyjfc(a(L  ))  =  Ujkio-iL  ))  =  Vjki(x(L  ))  =  Wy^.(a(L  ))  =  0. 
We  have  already  seen  that  the  correlation  parameters  are  zero  when  I  a(L  )  I  >  1.  We  thus 
may  conclude  that  L  —  K—\. 

There  are  a  number  of  important  observations  that  can  be  made  from  the  metric 
given  in  equation  (32).  First  of  all,  it  is  additive  in  the  sense  that  the  global  metric  for 
evaluating  the  most  likely  sequence  of  super-code  symbols,  { /ip(.)(a(j))  }  ^-a/  is  written 
as  a  sum  of  stage  metrics.  Furthermore,  the  i'*  stage  metric  depends  only  on  the  super 
code  symbols  in  the  set 

S  =  {^ip(,•)(ct(^)),  iip(,_i)(cx(i  — 1)),...,  ^/p(,_A:+i)(ot(i  — /sT+l))},  (33) 

along  with  the  matched  filter  outputs,  rop(;)(a(/))  and  rip(,)(a(/)),  as  well  as  the  energies 
and  correlations.  As  in  the  receivers  which  operated  at  the  code  symbol  level,  with  the 
matched  filter  bank  at  the  front  end  of  the  receiver  matched  to  the  signals  {skit~jT^ic)}f 
it  is  again  possible  to  estimate  the  crosscorrelations  using  the  local  oscillators  and  code 
generators  which  are  assumed  to  be  synchronized  to  the  K  components  of  the  incoming 
signal.  We  can  also  estimate  the  energies,  by  averaging  the  outputs  of  the 

matched  filters  for  a  number  of  bits.  (The  number  over  which  they  would  be  averaged 
would  depend  on  the  rate  at  which  the  relative  strengths  of  the  users  is  varying.)  It  is  also 
interesting  to  note  that  rofc(0  and  rutO)  be  constructed  from  the  standard  matched 
filter  output  that  is  matched  to  the  signal  Sk(0  by  storing  successive  outputs  and  then 
linearly  combining  them  as  is  illustrated  in  Figure  12. 

For  any  user,  Jtc,  the  convolutional  encoder  defines  a  mapping  rul^  from  the  input 
information  symbol  and  present  state  of  the  encoder  to  the  code  bits,  D  IjtO)  “^d  Dikij). 
In  addition,  equation  (9)  defined  a  mapping  rule  from  these  code  symbols  to  the  complex 
super-code  symbol,  40')-  If  we  define  the  function  h  (•)  as  the  composite  mapping  rule 


/i  (.t-iTb-Qk)[SkO  -irb-Qk)+Sk(.t  -(i  +‘A)7fc-0*)] 


Rgure  12  Correlator  front  end  to  the  globally  optimal  MLSE  and  an  alternate  front  end  which  constructs  the 

same  correlator  outputs  from  the  basic  correlator  with  the  standard  signature/carrier  waveform  basis  functi 
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from  the  input  information  symbol  and  state  to  the  complex  super-code  symbol,  then  by 
substituting  the  information  symbols  that  define  the  state  of  the  encoder  in  for  the  state, 
the  following  expression  may  be  written: 

dkij)  =  h  (4C/).  40-1) . h(J-W+l) )  (34) 

Thus,  in  this  form,  it  is  clear  that  the  complex  super-code  symbol  depends  on  only  certain 
information  symbols.  Using  this  information,  it  is  easy  to  redefine  the  set  5  which  was 
defined  in  equation  (33)  in  terms  of  the  information  symbols  which  affect  the  i  stage 
metric. 

S  =  {/p(o(a(0).a.}  (35) 

Q  =  {/p(i-i)(ot(*  -1))>  ^p(i-2)(ct(i“2)),...,  I  p(,-h'a:+i)(oc(i -W^T+l))}  (36) 

Thus  it  is  now  apparent  that  the  system  may  be  described  in  terms  of  ^  states,  since 
the  information  symbols  are  binary.  Furthermore,  the  maximum  likelihood  sequence 
estimator  can  be  implemented  with  a  Viterbi  algorithm  operating  on  a  trellis  with  7^^ 
states  and  two  branches  per  state.  This  trellis  will  be  cyclically  time-varying  as  in  the 
uncoded  case.  [1]  Furthermore,  it  is  clear  that  this  trellis  reduces  to  the  trellis  derived  in 
[1]  when  the  constraint  length  of  the  code  is  one.  (uncoded  transmission  for  each  user) 

The  time  complexity  per  bit  decoded  for  the  MLSE  is  TCB  =  O  (2^^)  since  there 
are  metrics  which  must  be  computed  at  each  stage  of  the  trellis  and  one  intor- 

mation  bit  is  decided  at  each  stage.  Note  that  for  the  case  of  W  =  1,  the  TCB  calculated 
in  [1]  is  again  obtained. 

Obviously,  the  number  of  states  in  the  MLSE  grows  very  quickly  with  both  the 
number  of  users  in  the  system  and  the  constraint  length  of  the  codes  being  used.  In  fact, 
for  the  simple  4-user  example  used  in  the  simulations  of  the  decision  feedback  equalizers 
where  each  user  used  a  W  =  3,  or  4-state  code,  the  MLSE  requires  a  Viterbi  algorithm 
operating  on  a  trellis  with  2048  states! 

Figure  11  provides  an  example  of  a  2-state  code  that  may  be  used  by  each  user  in  a 
CDMA  system.  If  there  are  2  users  in  the  system  using  the  code  of  Figure  1 1,  the  MLSE 
trellis  will  have  8  states  and  is  shown  in  Figure  13. 

4.2  MLSE  For  Rate-P/Q  Convolutional  Codes 

Now  that  the  MLSE  has  been  derived  for  the  rate- 1/2  case,  it  is  not  difficult  to  gen¬ 
eralize  to  the  case  of  rate-F/Q  convolutional  codes.  The  general  metric  will  not  be 
derived  here,  but  instead  we  will  highlight  the  differences  from  the  case  derived  above. 


d2{k-\)  =  h(l2(k-l),l2(k-2y) 

C2k  =  ih(k)Mk-l)Ji(k-iy) 
di(k)  =  h(Ii(k),Ii(k-\)) 


Rgure  13  Trellis  for  the  globally  optimum  MLSE  for  the  2-user,  2  state 

code  example  in  Rgure  11.  The  trellis  is  cyclically  time- varying 

and  the  meaning  of  the  state  at  some  particular  stage  is  given  by  either  Cu  or  <j2jfc  for  users 

one  and  two  respectively  in  information  bit  period  k.  The  complex  super-code  symbol  djik)  labels  the 

trellis  branches  and  an  expression  for  the  mapping  between  information  symbols  and  super-code  symbols  is  given. 
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First ,  consider  the  case  of  a  system  where  each  user  operates  with  a  rate- 1/(2  convo¬ 
lutional  code.  The  function  Dkit)  will  again  have  to  be  constructed  from  a  set  of  orthogo¬ 
nal  basis  functions.  In  the  rate- 1/2  case,  this  was  accomplished  with  the  basis  set 
where  both  basis  functions  span  the  time  interval  [0,Th).  For  the  rate-l/Q 
code  case,  a  set  could  again  be  constructed  where  each  basis  function  in  the  set  spans  the 
entire  information  bit  period,  however  it  is  much  simpler  to  use  a  set  where  each  basis 
function  is  non-overlapping  in  time  with  the  others  and  occupies  an  interval  of  length 
Tt/Q.  It  is  worth  noting  that  this  same  set  could  have  been  used  in  the  rate- 1/2  case  to 
yield  the  same  super-code  function,  If  we  had  used  this  alternate  basis  set  for  the 
rate- 1/2  case,  the  resulting  metric  would  have  had  a  different  form,  but  it  would  be 
equivalent  to  the  one  derived. 

To  illustrate  a  possible  formulation  for  the  rate-l/Q  case,  let  D  ljt(/)  through  DQk{i) 
be  the  Q  outputs  of  the  convolutional  encoder  for  user  k  at  time  i  appropriately  mapped  to 
the  antipodal  set.  Then  the  super-code  function  D^it)  could  be  defined  as: 

Dicit-iTb-Qk)=Dlk(.i>git-iTb-Qk)+D2kiiygit-ii+yQ)Tb-Qk)  +••• 

...+  DQkiiygit-ii+(.Q -iyQ)Tb-Qk)  (37) 

where  git)  is  a  function  with  value  one  on  the  interval  nT<t<  in+l)T  and  zero  else¬ 
where,  and  Tt  =  QT.  Again,  the  functions  [skit)}  would  be  constructed  from  scaled  con¬ 
catenations  of  Q  versions  of  Skit).  To  derive  the  metric,  we  could  begin  with  (28)  and 
proceed  in  the  same  fashion  as  in  section  4.1.  The  number  of  states  in  the  trellis  used  by 
the  Viterbi  algorithm  will  again  be  ,  and  the  TCB  =  O  (2^^).  While  the  number  of 
states  in  system  is  not  affected  by  the  rates  of  the  codes  used  by  each  user,  it  is  worth 
keeping  in  mind  that  the  metrics  used  to  evaluate  paths  become  increasingly  complex  as 
Q  increases. 

For  the  general  rate-P/Q  case,  we  again  use  the  definition  k  =  log25  where  S  is  the 
number  of  states  in  the  single  user’s  encoder.  Thus,  k  represents  the  total  binary  memory 
order  of  each  user’s  encoder.  There  are  2^  input  hypotheses  to  test  in  each  information 
bit  period  for  each  user,  so  the  overall  trellis  will  have  2^  branches  per  state.  Further¬ 
more,  the  state  of  the  system  will  be  specified  by  (K-f-P)(/if— 1)+k  information  bits,  so  it 
will  have  states.  This  will  result  in  a  TCB  =Oi2'^^^^/P).  A  comparison 

with  the  TCB  estimates  of  the  DFE  structures  discussed  in  section  3  shows  how  vastly 
much  more  complex  the  MLSE  is  than  the  DFE  approaches  in  most  cases. 

Clearly  the  exponential  dependence  ol  the  TCB  on  the  number  ot  users,  the  number 
of  states  in  each  of  the  user’s  codes  and  P  makes  the  use  of  the  optimal  decoder  prohibi¬ 
tive  for  a  realistic  system.  It  is,  however,  an  important  receiver  because  it  represents  the 
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best  that  can  be  achieved  in  terms  of  sequence  error  probability,  and  it  will  provide  a 
good  baseline  on  which  to  judge  the  quality  of  suboptimal  schemes.  This  receiver  also 
raises  the  possibility  of  using  a  variety  of  sparse  searching  algorithms  like  a  sequential 
decoder  as  was  used  in  [24]  for  the  uncoded  case,  or  reduced  state  sequence  estimation 
techniques  like  those  proposed  in  [10]  for  the  uncoded  MUI  equalization  problem  and 
[27]  and  [28]  for  the  combined  equalization  and  decoding  problem  for  single-user  links 
suffering  from  ISI. 

4.3  Performance  of  the  Optimal  Sequence  Estimator 

To  illustrate  the  derivation  of  some  performance  bounds  for  the  MLSE,  we  will 

again  use  the  rate  1/2  code  example.  In  this  analysis  we  will  follow  fairly  closely  the 

analysis  which  appeared  in  both  [1]  and  [19].  In  keeping  with  [1],  we  consider  the 

decoding  window  to  range  from  the  index  -M  to  the  index  M,  which  corresponds  to  each 

of  the  K  users  transmitting  for  information  bit  periods  ranging  from  -M/K  to  M/K.  The 

goal  of  this  section  is  to  estimate  the  performance  of  the  optimal  sequence  estimator  by 

bounding  the  finite  and  infinite  horizon  error  probabilities  for  the  user  in  the  system, 

denoted  Pf  (/)  and  P*  =  lim  Pf  (i).  To  simplify  the  notation,  we  will  use  the  following 

Af— 

definitions: 


(38) 

di  =^p(o(oc(0) 

(39) 

pint  ^  Pp(i)P(m)(^0^^)“"^(O) 

(40) 

^im  ”  ^ P(i)P(m)(^('^)”^(0) 

(41) 

(42) 

^im  "  ^p(i)p(m)(^(^)”'^(0) 

(43) 

roi  =  rop(,)(a(0)  =  Re  [a?,•]•^/i^P.y  “  Rn  [dj\'^Vij  +  Zoi 

j=i-K+\ 

(44) 

rii  =  r,p(,)(a(/))=  ‘"’I"'  Re[dj]M^ij-Iin[dj]'MUij  + zu 

(45) 

where  zoi  and  zi,-  are  the  noise  statistics  at  the  output  of  the  matched  filters  for  the  basis 
functions  (t)op(,)  and  (l)ip(()  respectively  for  that  user’s  data  interval  a(i). 

We  next  consider  the  transmission  of  the  sequence  of  super-code  symbols,  d  = 
{d„}  n=-My  a  competing  sequence  corresponding  to  the  sequence  {r/„+e„} 
where  e  =  {c„}  n=-M  is  a  sequence  of  error  symbols.  Each  can  take  on  values  in  the 
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set  F=  {0,  ±2,  ±2y,  ±1±7}.  Next,  define  the  following  sets: 

B  =  [e:  e*(i)eF,  k=l,...,K,  e^iO^  for  some  i,k  }  (46) 

Aid)  =  {7:e£B,d+7eC}  (47) 

C  =  {d:dEhi{7})}  (48) 

where  h  (•)  is  the  mapping  rule  defined  by  the  code  from  an  information  sequence,  7,  to  a 
sequence  of  super-code  symbols,  d.  Since  this  mapping  rule  is  a  one  to  one  function,  it 
has  an  inverse.  If  we  define  the  information  error  sequence 

^  =  h-Hd  +  e)-7  (49) 

which  is  the  information  bit  error  sequence  corresponding  to  d  +  e  such  that  if  d  =  h(I), 
then  d  +  e  =  h(7+^).  This  allows  us  to  define 

Ak{d,i)  =  [e:  eeA  (d),  (50) 

so  A^(d,i)  is  the  set  of  admissible  error  sequences  which  affect  the  information  bit  of 
the  jt'*  user.  From  these  definitions,  it  follows  that  the  conditional  probability  of  error 
for  the  bit  of  the  jt"*  user  given  the  transmitted  super  code  symbol  sequence  d  is  given 

by 

Pfii  \d  sent)  =  P  [Joid -l-?)>7o(^)  I  d  sent }  (51) 

and  so  the  probability  of  error  for  the  i'*  information  bit  of  the  7"'^  user  may  be  written  as 

PkiO  =  Z  Pkii  I  d  sentyP^id  sent)  (52) 

dzC 

Since  an  expression  for  P^ii  I  d  sent)  is  not  known,  we  choose  to  bound  it  with  a  union 
bound. 

Pk(})  ^  Z  Z_  ^  (Jo(d+7)>Jo(d)  I  d  sentyP^id  sent)  (53) 

dzC  7eA^{d,i) 

It  can  be  shown  that  by  modifying  the  expression  given  in  equation  (28)  and  substi¬ 
tuting  the  above  definitions  we  obtain  the  following  form  of  the  metric 
M  f— '  Af  M 

Joid)  =  2Re  Z  yEb(Pe[di]roH'n[di]ru)  -  Z  Z  iRe[di]Re[d,„]pi,„ 

J  i=-M  m=-M 

+Im  \.di\\m[d,„]Vi,n  -  Re  [r//]Im[J„.] -  lm[di]Re  [d,nWi^)  (54) 

The  event  J^fd+e)  >  Jo(d)  may  now  be  expanded  using  equation  (54)  and  substi¬ 
tuting  (44)  and  (45)  for  ro/  and  ru  respectively.  After  some  algebra,  the  following 
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expression  for  this  event  is  obtained. 

MM 

2  2  {Re  [e,]Re[e,„]p„„  +  Im 

-Re  [ei]Im[e„.]Vi„.  -lm[ei]Re  [e,„]W/,„) 


Ai  1 

<  2yE‘i,  {Re  [ei]zoi  -  Im  [e/]z  i,) 

i=^M 

(55) 

Let  the  left  side  of  the  inequality  in  (55)  be  denoted  by  A^(F).  The  right  side  of  equation 
(55)  is  a  linear  combination  of  Gaussian  random  variables,  zqi  and  z  i,-.  We  know  that 

£[zo/]=^bii]=0 

(56) 

and  also  that 

,  Nq  - 

RlZQiZQin]  ~  2  P"” 

(57) 

Nq  - 

^  U  li  ^  1/n]  — 

(58) 

No  - 

R  [Z 0/  Z  l/n]  —  2 

(59) 

No  - 

E\Z\iZ0tn\~  2 

(60) 

As  a  result,  if  we  define  y  to  be  the  right  side  of  equation  (55),  namely 

M  f . T' 

y  =  X  {Re  [ei]zoi  -  Im  [e/]z  i,) 

(61) 

i=-M 


then  it  is  not  difficult  to  show  that  £[y]=0  and  Var[y]=2NQ  d?(e).  Next,  the  two- 
sequence  error  probability,  or  the  probability  of  the  event  given  in  equation  (55), 
becomes  the  probability  that  the  Gaussian  random  variable,  y,  is  larger  than  the  threshold, 
A^CF).  Thus 


P{jQ{d  +  e)  >  jQ{d)  \  dsent)  =  Q 


A^(e) 

^INoAHe) 


(62) 


We  next  define  the  following  efficiency  parameter  for  the  pair  of  sequences  separated  by 
the  code  symbol  error  sequence,  e,  as 


(63) 
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This  allows  us  to  rewrite  (62)  as 


P{jQ{d  +  e)>  JQ{d)  \  dsent)  =  Q 


V  No 


(64) 


so  r[^(e)  may  be  viewed  as  an  asymptotic  efficiency  relative  to  uncoded  BPSK  U'ansmis- 
sion  for  the  user  for  the  pair  of  sequences  d  and  d  +  e.  It  is  also  worth  noticing  that 
the  signal  generated  by  modulating  the  error  sequence 

AI/K  K  — — 

S(e,t)=  £  (65) 

i=-M/K  n=l 

has  energy 

oo 

I  15(7,0 1 1^=1  \S(e,t)\^dt  =  A^(e)  (66) 


This  implies  that  an  alternate  way  to  express  the  efficiency  parameter  defined  in  (63)  for 
the  pair  of  sequences  d  and  d+e  ^ 


4£'/,jk 


(67) 


which  is  analogous  to  the  form  of  the  distance  measure  in  [1]  for  the  uncoded  system. 

In  order  to  construct  a  lower  bound  on  the  probability  of  error  tor  user  k,  we  define 
the  following  minimum  efficiency  as 

ntnin(0  =  Lnf  inf_  Tlf(e)  (68) 

dtC  eeA^(d,i) 


and  so 


No 


(69) 


So  we  now  have  a  lower  bound  expression  for  P^iO  given  in  (69)  and  an  upper  bound 
expression  when  (64)  is  substituted  into  equation  (53). 

To  obtain  bounds  for  the  infinite  horizon  error  probability,  =  lim  Pff  (/),  we  may 

M— >«» 

use  the  same  argument  used  in  [1],  as  it  applies  equally  well  to  the  coded  case,  namely 
for  any  error  sequence  e  such  that  ej=e„^0  for  j^n,  the  sequence 

m<j 

^m+n—j  fn>j 


(70) 
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salisfies  A^(e')<A^(e)  or  equivalently  T]/f(e')<T]^(e)  or  else  it  would  be  possible  to 
construct  a  sequence  with  a  negative  energy.  Thus  we  may  conclude  exactly  as  in  [1]  that 
the  infinite  horizon  efficiencies  Ti;t(<i)  and  are  achieved  by  finite  length  error 

sequences.  As  a  result,  the  infinite  horizon  error  probability  for  the  k‘^  user  may  be 
lower  bounded  by 


Pk—^  [Tlt-(<^)  — Tlt'.Hiin]  *  Q 


(71) 


Similarly,  by  passing  (53)  to  the  limit  as  M  approaches  infinity 


^ifc- E  X_  P id  sent)’ Q 
dzC  ezAkidd) 


ill) 


Where  A^idJ)  =  lim  A^id,i)- 

In  the  high  signal  to  noise  ratio  regime,  the  terms  in  (72)  with  the  minimum 
efficiency  will  dominate  the  asymptotic  behavior  of  the  receiver.  As  a  result,  we  will 
refer  to  the  minimum  efficiency,  T\k,min  ^  the  asymptotic  multiuser  coding  gain  for  user 
k.  The  asymptotic  multiuser  coding  gain  is  an  efficiency  parameter  which  is  a  measure  of 
the  energy  gain  or  loss  of  the  system  relative  to  an  uncoded  BPSK  system  operating 
alone  with  an  energy  per  information  bit  of  Ei^k.  The  asymptotic  gain  relative  to  this 
reference  BPSK  system  in  decibels  is  lOlogioBt-.mm- 

In  the  limiting  case  where  there  is  only  K  =  l  user  in  the  system,  'r\k,mm  is  the 
asymptotic  coding  gain  of  that  user’s  code.  If  there  are  K  users  in  the  system  with  orthog¬ 
onal  super-signature  sequences,  then  Tj*  is  again  the  coding  gain  of  a  single  user  sys¬ 
tem  operating  with  the  same  code.  In  the  limiting  case  where  the  users  do  not  employ 
coding,  r\k,mm  is  equivalent  to  the  asymptotic  multiuser  efficiency  obtained  in  [1]  for  the 
optimal  multiuser  receiver  for  the  uncoded  system.  Thus  the  asymptotic  multiuser  cod¬ 
ing  gain  unifies  the  asymptotic  coding  gain  and  the  asymptotic  multiuser  efficiency 
parameters.  This  parameter  will  also  allow  the  study  of  near-far  resistance  for  this 
receiver,  which  is  the  infimum  of  over  the  energies  of  all  of  the  interfering  users. 

In  general,  the  computation  of  r\k,min  is  a  difficult  problem.  In  a  subsequent  report, 
we  will  examine  the  problem  of  the  computation  of  for  some  specific  cases  and 
will  provide  some  conditions  for  this  parameter  to  be  no  smaller  than  the  free  distance  of 
the  code  for  user  k.  In  this  report,  to  provide  some  direct  comparisons  between  the  per¬ 
formance  of  the  MLSE  and  a  representative  of  the  DFE  receiver  class,  we  will  use  a 
computer  simulation  for  some  two-user  cases. 
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Figures  14  through  16  show  the  results  of  a  simulation  of  a  two-user  system  where 
each  user  employs  the  4-state  convolutional  code  shown  in  Figure  10.  The  resulting 
super-trellis  used  by  the  MLSE  has  32  states.  The  MLSE,  integrated  DFE  and  conven¬ 
tional  decoder  (or  one-stage  integrated  DFE)  were  simulated  for  the  various  conditions  of 
each  figure.  In  each  figure,  the  cross-correlation  between  the  code  symbol  level  signa¬ 
tures,  pjk(.l)  is  defined  for  each  j,  k  and  1.  It  is  then  assumed  for  convenience  that  the 
information  bit  periods  have  delays  such  that  0*  =  for  each  user,  k.  This  assumption 
implies  that  the  relationships  defined  in  (21)  describe  the  correlation  parameters  for  the 
MLSE. 

In  Figure  14,  the  case  where  pi2(0)  =  0.3  and  pi2(-l)  =  0.3  is  simulated.  For  this 
degree  of  MUI,  the  MLSE  is  able  to  perform  at  the  single-user  performance  level  over 

the  range  of  Ei/Nq  simulated,  while  the  2  and  3  stage  integrated  DFEs  suffer  some  loss. 

■”2 

In  this  case,  the  conventional  receiver  suffers  about  1.5  dB  at  average  -  . 

Figure  15  illustrates  a  more  severe  MUI  environment  where  pi2(0)  =  0.4  and 
Pi2(-1)  =  0-4.  In  this  case,  the  MLSE  is  again  nearly  able  to  recoup  all  of  the  loss  that 
the  conventional  decoder  suffers  when  compared  with  the  performance  in  the  single-user 
environment.  At  Pb  ~  3T0”^  the  integrated  DFE  approaches  lose  about  1.5  dB. 

In  Figure  16,  the  same  0.4  channel  is  simulated  for  a  varying  near-far  energy  ratio. 
This  figure  shows  that  both  the  MLSE  and  the  integrated  DFE  approaches  appear  to  be 
near-far  resistant,  although  this  has  yet  to  be  shown  analytically.  Again  the  MLSE  is  not 
outperformed  by  any  of  the  other  structures  at  any  operating  point. 

It  is  worth  noting  that  all  of  the  work  in  this  section  has  been  based  upon  the  metric 
for  the  case  where  each  user  in  the  system  employs  rate- 1/2  convolutional  codes.  The 
expression  for  the  distance  and  asymptotic  multiuser  coding  gain  will  be  more  compli¬ 
cated  in  the  general  raxe-P/Q  code  case,  but  the  derivation  procedure  will  be  the  same. 
Thus  the  work  in  this  section  is  meant  to  illustrate  the  general  procedure  for  the  error 
analysis  of  the  more  complex  general  code  rate  case. 

To  summarize  the  results  of  this  section,  we  have  derived  the  distance  measure 
between  sequences  in  the  super-trellis  of  a  rate- 1/2  coded  CDMA  system.  We  then  wrote 
this  distance  in  the  form  of  the  asymptotic  multiuser  coding  gain,  This  parameter 

unifies  the  asymptotic  coding  gain  and  the  asymptotic  multiuser  efficiency  parameters. 
Finally,  a  simulation  illustrated  the  behavior  of  the  MLSE  in  a  variety  ot  situations  tor  a 
2-user  example. 


.conventional 


baverage 


Figure  14  Performance  curves  of  the  MLSE  and  integrated  decision  feedback  receivers 
for  a  2-user  channel  with  p^CO)  =  0.3  and  p^C-l)  =  0.3  and  equal  energies.  The  solid 
lines  show  a  single  user  system  (no  MUI)  with  and  without  the  rate- 1/2  4-state  convolu¬ 
tional  code.  The  two  and  three  stage  integrated  DFEs  are  shown  as  dashed  lines  and  the 
MLSE  is  shown  as  a  dotted  line. 
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Figure  15  Performance  curves  of  the  MLSE  and  integrated  decision  feedback  receivers 
for  a  2-user  channel  with  pi2(0)  =  0.4  and  pnC-l)  =  0.4  and  equal  energies.  The  solid 
lines  show  a  single  user  system  (no  MUI)  with  and  without  the  rate- 1/2  4-state  convolu¬ 
tional  code.  The  two  and  three  stage  integrated  DFEs  are  shown  as  dashed  lines  and  the 
MLSE  is  shown  as  a  dotted  line. 
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Figure  16  Near-far  ratio  performance  curves  of  the  MLSE  and  integrated  decision  feed¬ 
back  receivers  for  a  2-user  channel  with  PnCO)  =  0.4  and  Pi2(“l)  “  i^o  ~  2  dB. 

The  single-user  system  performance  level  (no  MUI)  with  the  rate- 1/2  4-state  convolu¬ 
tional  code  is  shown  as  a  solid  line  and  the  MLSE  performance  is  shown  as  a  dotted  line. 
The  two  and  three  stage  integrated  DFEs  are  shown  as  dashed  lines  and  the  conventional 
is  shown  as  a  solid  line. 
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5.  Conclusions 

In  this  report,  both  decision  feedback  and  trellis-based  multiuser  decoders  were  stu¬ 
died  for  links  which  use  CDMA  and  convolutional  codes.  A  variety  of  decision  feedback 
approaches  were  studied  and  some  simulation  results  were  presented.  These  results  show 
that  the  DFE  techniques  were  able  to  significantly  improve  the  performance  of  the  con¬ 
ventional  basestation  architecture.  In  most  cases,  the  performance  of  the  integrated  DFE 
was  superior  to  that  of  the  SCS-DFE  approaches  which  considered  the  equalization  and 
decoding  operations  separately.  The  drawbacks  of  the  integrated  DFE  architecture  are  its 
increased  complexity  and  decoding  delay  over  that  of  the  SCS-DFE  approaches. 

In  the  second  half  of  this  report,  two  trellis-based  receivers  were  examined.  The  first 
was  a  structure  which  attacked  the  MUI  and  the  decoding  operations  separately.  The 
second  approach  was  an  optimum  sequence  estimator  for  the  problem.  It  was  shown  that 
the  complexity  of  the  MLSE  depends  exponentially  on  the  number  of  users  in  the  system, 
the  number  of  states  in  each  user’s  encoder  and  the  number  of  input  information  bits,  P. 
This  high  complexity  points  to  the  use  of  suboptimal  trellis  based  approaches  such  as 
reduced  state  sequence  estimation,  [10]  and  [27],  and  sequential  decoding  approaches 
such  as  in  [24]  to  provide  a  complexity  versus  performance  tradeoff. 

Next,  an  outline  of  the  performance  analysis  of  the  system  was  given.  In  this 
analysis,  we  presented  the  asymptotic  multiuser  coding  gain  parameter  which  unifies  the 
asymptotic  coding  gain  parameter  of  coded  systems  with  the  asymptotic  efficiency 
parameter  of  uncoded  multiuser  systems.  Finally,  to  provide  a  direct  comparison  of  the 
performance  of  the  MLSE  to  the  DFE  approaches,  a  simulation  of  the  MLSE  and  the 
integrated  DFE  was  performed. 
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